Discovering regularities in Big Data is nowadays a crucial task in many different applications, from bioinformatics to cybersecurity. To this aim, a promising approach consists in performing data clustering with Local Metric Learning, i.e. trying to discover well-formed (compact and populated) clusters and, at the same time, a suitable subset of features corresponding to the subspace where each cluster lies. In this paper, we propose a novel evolutionary agent based clustering algorithm, where agents act as individuals of an evolving population, each one performing a simple cluster analysis task on a different subset of patterns randomly drawn from the entire dataset. A customized genetic algorithm orchestrates the evolution of a population of such agents. This approach is able to reveal clusters that are not evident when using the whole set of features, and can deal with large datasets, as each agent will process a small subset of patterns. First results obtained considering a synthetic dataset are encouraging, suggesting future developments.

An evolutionary agents based system for data mining and local metric learning / Giampieri, Mauro; Rizzi, Antonello. - ELETTRONICO. - (2018), pp. 1461-1466. (Intervento presentato al convegno 19th International Conference on Industrial Technology ICIT 2018 tenutosi a Lyon, France nel February 20-22 2018) [10.1109/ICIT.2018.8352396].

An evolutionary agents based system for data mining and local metric learning

Mauro Giampieri;Antonello Rizzi
2018

Abstract

Discovering regularities in Big Data is nowadays a crucial task in many different applications, from bioinformatics to cybersecurity. To this aim, a promising approach consists in performing data clustering with Local Metric Learning, i.e. trying to discover well-formed (compact and populated) clusters and, at the same time, a suitable subset of features corresponding to the subspace where each cluster lies. In this paper, we propose a novel evolutionary agent based clustering algorithm, where agents act as individuals of an evolving population, each one performing a simple cluster analysis task on a different subset of patterns randomly drawn from the entire dataset. A customized genetic algorithm orchestrates the evolution of a population of such agents. This approach is able to reveal clusters that are not evident when using the whole set of features, and can deal with large datasets, as each agent will process a small subset of patterns. First results obtained considering a synthetic dataset are encouraging, suggesting future developments.
2018
19th International Conference on Industrial Technology ICIT 2018
big data mining; agent based clustering; local metric learning; genetic algorithms
04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
An evolutionary agents based system for data mining and local metric learning / Giampieri, Mauro; Rizzi, Antonello. - ELETTRONICO. - (2018), pp. 1461-1466. (Intervento presentato al convegno 19th International Conference on Industrial Technology ICIT 2018 tenutosi a Lyon, France nel February 20-22 2018) [10.1109/ICIT.2018.8352396].
File allegati a questo prodotto
File Dimensione Formato  
Giampieri_Evolutionary_2018.pdf

solo gestori archivio

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 560.54 kB
Formato Adobe PDF
560.54 kB Adobe PDF   Contatta l'autore

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1034874
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 4
  • ???jsp.display-item.citation.isi??? 1
social impact